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Abstract. Numerical results on the translocation of long biopolymers through mid- 
sized and wide pores are presented. The simulations are based on a novel methodology 
which couples molecular motion to a mesoscopic fluid solvent. Thousands of events of 
long polymers (up to 8000 monomers) are monitored as they pass through nanopores. 
Comparison between the different pore sizes shows that wide pores can host a 
larger number of multiple biopolymer segments, as compared to smaller pores. The 
simulations provide clear evidence of folding quantization in the translocation process 
as the biopolymers undertake multi-folded configurations, characterized by a well- 
defined integer number of folds. Accordingly, the translocation time is no longer 
represented by a single-exponent power law dependence on the length, as it is the case 
for single-file translocation through narrow pores. The folding quantization increases 
with the biopolymer length, while the rate of translocated beads at each time step is 
linearly correlated to the number of resident beads in the pore. Finally, analysis of the 
statistics over the translocation work unravels the importance of the hydrodynamic 
interactions in the process. 
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1. Introduction 

Biological systems exhibit a complexity and diversity far richer than the simple solid 
or fluid systems traditionally studied in physics or chemistry. Advances in computer 
technology and breakthroughs in computational methods have been constantly reducing 
the gap between quantitative models and actual biological behavior. The main challenge 
remains the wide range of spatio-temporal scales involved in the dynamical evolution of 
complex biological systems. In response to this challenge, various strategies have been 
developed recently, based on composite computational schemes in which information 
is exchanged between the scales. Motivated by recent experimental studies [TJ E], 
we apply such a computational scheme to the translocation of a biopolymer through 
nanopores. The translocation of biopolymers plays a major role in many important 
biological processes, such as viral infection by phages, inter-bacterial DNA transduction, 
and gene therapy j3]. The ultimate goal of these studies is to open a path for ultra- 
fast DNA-sequencing by sensing the base-sensitive electronic signal as the biopolymer 
passes through a nanopore with attached electrodes. The importance of this process has 
spawned a number of in vitro experiments, aimed at exploring the translocation process 
through micro-fabricated channels [3] under the effects of an external electric field, or 
through protein channels across cellular membranes [5j [6]. From a theoretical point of 
view, simplified schemes [7] E], coarse-grained or microscopic models with and without 
hydrodynamic interactions [9j [TO], [TTJ or mesoscopic approaches [12] are able to analyze 
universal features of the translocation process. However, a quantitative description of 
this complex process, which involves the competition between many-body interactions 
at the atomic or molecular scale, fluid-atom hydrodynamic coupling, as well as the 
interaction of the biopolymer with wall molecules in the region of the pore, calls for 
state-of-the art modeling, towards which the results presented here are directed. 

In a recent paper, the translocation of biopolymers through (relatively) large pores 
was reported to exhibit the intriguing phenomenon of current-blockade quantization [Tj. 
This was interpreted as an indirect evidence that the polymer crosses the pore in the 
form of "quantized" configurations, associated with integer values of the folding number, 
the number of strands simultaneously occupying the pore during the translocation. 
Such a behavior has been recently confirmed by the direct observation of multi-folded 
configurations in large-scale numerical simulations of biopolymer translocation by the 
present authors [13]. Here, we considerably extend the scope of such simulations, by 
increasing the polymer lengths up to 8000 monomers, about an order of magnitude above 
any previous simulation in the field. This unexplored regime of polymer lengths reveals 
an extremely rich configurational dynamics, especially in larger pores. In this work, 
we elaborate more on this enriched dynamics and analyze in detail the phenomenon of 
folding quantization. 
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2. Multiscale scheme 

Our simulations are based on a multiscale methodology [14] . which involves the coupling 
of a mesoscopic lattice Boltzmann (LB) [15] approach for the solvent degrees of freedom 
and molecular dynamics (MD) for the biopolymer motion. The comparison of our 
previous results to those of experiments of DNA translocation, allows us to map the 
anonymous simulated polymers to actual biopolymers [IE] . A three-dimensional box of 
size (N x xNyX N z )Ax lattice units, with Ax the spacing between lattice points surrounds 
both the solvent and the polymer. We take N x = 2N y , N y = N z and N x = 128 and 
biopolymers with iVo = 100 — 8000 beads, spanning nearly two orders of magnitude in 
polymer length. At t = the polymer resides entirely in the right chamber at x > ^ Ax. 
A separating wall is located in the mid-section of the x direction, at x = ^Ax. At the 
center of this wall, a cylindrical pore of length l p = 3Ax and diameter d p is opened up. 
We have used two different pore diameters, a small d p — 5 and wide d p = 9 in units 
of Ax. Translocation is induced by a constant electric field, localized around the pore, 
similarly to the experimental settings [2], acting along the x direction and confined to 
a cylindrical channel of the same size as the pore and length 3Ax along the streamwise 
(x) direction. All parameters are measured in units of the LB time step and spacing, At 
and Ax, respectively, which are both set equal to 1. The MD time step is 5 times smaller 
than At. With the pulling force associated with the electric field in the experiments set 
at 0.02 and the temperature at 10~ 4 , the process falls in the fast translocation regime. 

The monomers interact through a Lennard- Jones 6-12 potential with parameters 
a = 1.8, e = 10 -4 and a cut-off at 2.02. The interaction of the monomers with the wall 
is modeled by a Lennard- Jones 6-12 potential with parameters a w = 1.5, e w = 10~ 3 and 
a cut-off at 1.68. Accordingly, the effective width and radius of the surrounding pore 
should take into account the repulsive monomer-wall interactions, so that a monomer 
is counted as being inside the pore if contained in a cylinder of effective width ~ 6Ax 
and radius ~ 3.5 and 7.5 for d p = 5 and 9, respectively. The bonds between adjacent 
beads are modeled through springs, with a spring constant k = 0.5 and an effective 
equilibrium bond length b = 1.2. The solvent has density p = 1, kinematic viscosity 
v — 0.1 and damping coefficient with the embedded particles 7 = 0.1. Further details 
can be found in Ref. [II]. One lattice spacing Ax = 42 nm, so that the bond length 
b = 1.2 Ax corresponds to the persistence length of double-stranded DNA (50nm). In 
measuring the residence number of beads in the pore region we have defined a cylinder 
of length 10 Ax and radius d p centered at the pore midpoint and with its axis aligned 
with the pore axis. 

3. Configurational analysis 

The ensemble of simulations is generated by different realizations of the initial polymer 
configuration, to account for the statistical nature of the process. The combined 
statistics over initial conditions and time evolution of the simulations, delivers an 
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Figure 1. Snapshots of a biopolymer (Nq = 4000) translocating through a wide pore 
(dp = 9). Different folding conformations are visible. A force (not shown) is applied 
at the pore region and initiates translocation from right to left. In order to reveal the 
polymer conformation as it passes though the pore, the wall is shown thinner than it 
actually is in the simulations. 



aggregate ensemble ranging from 500, 000 events for the shortest history (N Q = 100, 
dp = 9) up to nearly 10 million events for the longest one (N = 8000, d p = 5). 
Fig. [1] shows different snapshots of such a translocation event for a N = 4000 long 
biopolymer. In the initial stage of the translocation, the nanopore gets populated, 
with the biopolymer undertaking a high-fold conformation as it passes through the 
pore. The range of the number of folds explored by the translocation trajectories grows 
approximately with the cross-section of the pore and the polymer length. 

The observed translocation time results from a weighted average over a whole set of 
multi-folded configurations attained by the polymer during translocation, The weights in 
this average depend on the number of folds a polymer of a given length undertakes within 
the pore, and correspond to the probability distribution function (number of counts 
normalized to unity) shown in Fig. HI Within this picture, a single scaling exponent 
characterizing the translocation time as a function of the polymer length, appears to be 
insufficient because, above a given length, many folded-states are simultaneously excited, 
as their number is an increasing function of the pore diameter. The translocation time 
of these multi-folded configurations is shown to be dominated by the low-folded states, 
which explains the relatively minor deviations of the translocating time from the single- 
file power-law expression r ~ Nq 27 [2j, where r and N are the translocation time and 
the length of the biopolymer, respectively. In order to visualize this behavior, in Fig. 
[21 we report the translocation time as a function of the polymer length, N , for both 
pores and all events for each length. The figure shows that, up to a length of iVo = 1000 
for the wide (d p = 9) and Nq = 2000 for the narrower (d p = 5) pore, the most probable 
translocation time r obeys a scaling law of the form r ~ Nq, with a ~ 1.31 and 
a ~ 1.36, respectively. These values are slightly larger than the corresponding values 
for narrow nanopores [21 [14"] . 

This feature does not occur only due to statistical uncertainities, but may also 
depend on the pore width. Polymers translocating across a pore with q > 1, where q 
denotes a g-folded configuration, possibly enhance the resistance to the electric drive, 
due to the additional energy needed to keep them folded against internal flexional- 
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Figure 2. Scatter plots of all events for biopolymers translocating through the wide 
pore (d p — 9). The two different exponents, 1.36 and 0.75 (see text) are denoted by 
the two lines. The inset shows similar events through a narrower pore (d p — 5). Again, 
the two lines imply the two exponents 1.31 and 0.70. 



relaxation forces. These forces tend to un-fold the polymer, increasing its interaction 
with the walls, which may slow down the process as compared with the single-strand 
translocation pm [HI Ej. By restricting the analysis to the longest chains, iVo > 1000 
(iV > 2000) for the wide (narrow) pore, the bending of the curve could be interpreted 
as the emergence of a new scaling exponent, a2 ~ 0.75 (0.70). Evidently, there are 
not enough data points to support the exact values for the scaling exponents, but the 
significant decrease of these exponents for long biopolymers is qualitatively apparent. 
In Fig. [2J the two different exponents are denoted by the grey lines (with a different 
prefactor in each case in order to match the scaling law to the simulation data). We 
have found that this bending is due to the multi-fold conformation of the translocating 
biopolymer, which does not necessarily follow a power-law dependence on the polymer 
size. As a result, in contrast to single-file translocation (the case for narrow pores), 
multi-fold translocation does not need to obey a standard power-law scaling. 

An insightful way to monitor the translocation process is through the evaluation of 
the fraction of translocating beads with time. Averages over all events for all lengths 
are plotted in Fig. [3], where both the number of the translocated beads and the time 
are presented in reduced units, i.e. scaled with respect to the total number of beads and 
the total translocation time for each event and length. In such units, universality would 
result in a collapse of the translocation data at all lenghts into a single master curve. 

Fig. [3] shows a clear trend with size, which only very short chains (iVo = 100) do not 
follow. Initially, the shorter biopolymers exhibit a larger translocation speed and also 
a larger acceleration, but at the final stages this trend is reversed, as the longer chains 
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Figure 3. Scaled number of translocated beads with time for all lengths (No = 
100 — 8000). The pore diameter is d p = 9 and both axes are scaled with respect to the 
total number of beads and the total translocation time, respectively. The black dashed 
line corresponds to a constant translocation-speed, dN(t)/dt = Nq/t. The vertical 
dotted line shows the crossover discussed in the text. 

accelerate and eventually translocate faster through the pore. The crossover, where the 
translocation speed of the long biopolymers becomes larger than the corresponding of 
the shorter ones occurs at 0.7 Nq (see dotted line in Figj3]), a point at which 70% of the 
chain has already translocated. It is interesting to observe that this is a universal value 
for all lengths studied here, although at this point, this is a purely observational fact. 
Again, only polymers with iVo = 100 do not follow this trend, as these are too short. 
For all times, the beads follow a super-linear trajectory compared to the constant speed 
translocation (dashed line in the figure), but at the final stages the end part of the 
polymers translocate with constant speed. In the case of the narrower pore (d p = 5), 
the trend is similar, although no universal crossover close to the constant speed limit is 
observed. 

3.1. Quantization of the folding number 

In order to investigate the quantization of the resident beads in the pore, we monitored 
the distribution of the number of pore-resident beads iV r with the number q for a q- 
folded translocation. The resident monomers block the current across the channel, so 
that iV r conveys a direct measure of the current drop associated with the biopolymer 
passage through the nanopore. All distributions are peaked around quantized values 
of q (as defined below). A large fraction of the events are around q = 1, hence single- 
file translocation, which is also the conformation at the late stages of the process for 



7 




I I I I 1 I I I I I I i I i I i LI i I i I i I i I i I LL_ 

12345678902468 10 12 14 16 18 

folding number q 



Figure 4. Probability distribution of the folding number q for the entire set of polymer 
lengths, Nq = 100 — 8000, and both pore diameters (a) d p = 5 and (b) d p = 9. 

all lengths. In Fig. HI the cumulative statistics of the folding number, q = N r /N\, 
are shown as collected at each time-step of every single trajectory for a series of 100 
realizations for each polymer length. Here, N\ ~ 7 is the single-file value of the resident 
number, as also confirmed by visual inspection of the configurations. 

The wide pore (d p = 9) reveals a sharp quantization of the distribution of the 
folding number, with a very well defined peak in the distribution. Simulation data 
show, that the range of g-numbers, occupied in this case grows up to q ~ 10. The 
average q over all realizations for each length remains approximately constant ~ 1.3, 
up to iVo < 1000, and grows up to about 2.6 for N Q = 8000. Accordingly, the departure 
of the translocation time from a power-law at large No is mainly due to the increase 
of the average q with the polymer length (for N > 1000). This results from the shift 
of the probability distribution of the translocation time towards higher g-folds as Nq 
increases. For all lengths inspected, the time average of the folding number q remains 
below 3, because the states q = 1 and q = 2 continue to be the most populated ones. 

The narrower pore (d p = 5) shows the same trends, although on a smaller range 
of g-numbers, up to q ~ 6. Above this value, the quantized peaked structure in the 
folding probability is lost. For this pore, the average q over all realizations for each 
length remains close to 1.3, up to N < 2000, and grows up to about 2.1 for N = 8000. 
These data indicate that quantization of the folding number is better manifested by 
long chains crossing wide pores. The fact that the average q remains the same (~ 1.3) 
for both pores and relatively short biopolymers shows that, for these lengths, there 
is no essential effect by varying the pore size. On the contrary, for a wide pore, the 
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Figure 5. The average folding number q as a function of the polymer length for the 
wide (dp = 9) and narrower (d p = 5) pore, respectively. 



average q increases faster in the range of long biopolymers than for a narrower pore. 
It is quite natural to expect that long polymers transiting through large pores prove 
capable of supporting a widely richer spectrum of folded conformations as compared 
to the case of short polymers transiting through narrow pores. Graphical evidence for 
these points is given in Fig. [5l where the average folding number q for all cases studied is 
plotted as a function of the length iV . Evidently, the average q is almost constant up to 
Nq = 1000 — 2000 and increases thereafter, exposing the difference between the two pore 
sizes, which remains nonetheless rather mild. The highest-folding number supported by 
the polymer-pore system should increase quadratically with the pore diameter, as the 
number of monomers a given pore can accomodate is clearly proportional to the cross 
section of the pore. 



4. Forces influencing the translocation process 

The translocation dynamics depends on the strength of the frictional forces exerted by 
the wall. In single-file translocation, it is well known that strong friction changes the 
power-law exponent from ~ 1.2 to a linear dependence r oc Nq [IE]. In the case of multi- 
file translocation, it is unclear whether the high-fold configurations induce high-friction 
conditions, that would decrease the rate of the translocated beads (N — ^) with the 
resident monomers (N r ) for high values of the folding number q. We have observed 
that N is linearly correlated to iV r with basically the same average slope for all folds. 
Therefore, frictional forces affect a small layer close to the wall, and have little effect 
on the group of monomers translocating in the inner region of the pore. This seems 
to rule out the possibility that the change of exponent be caused by the pore frictional 
forces. Supporting data from about 100 events for the longest biopolymer (Nq = 8000) 
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Figure 6. Scatter plot of the translocation work: (top panel) the work for the driving 
field and (low panel) the work for the hydrodynamic field as functions of length N for 
translocation through the wide pore (d p = 9). 



translocating through the wide pore (d p = 9) reveal that all events are centered around 
integer values of the folding number q. As the characteristic number q increases, the 
rate of the translocating monomers with time increases linearly. Inspection of the results 
for all lengths (No = 100 — 8000) and both pore lengths (d p = 9 and d p = 5), shows 
that (a) for the same pore, the slope of the q-N curve increases with increasing chain 
length (long polymers slightly slow-down), and (b) for the same polymer length, the 
linear correlation between q and N is better manifested for wider pores. 

Another set of forces, which greatly affect the translocation are the fiuid-biopolymer 
interactions and the driving force at the pore region. The simulations reveal that solvent 
correlated motion makes a substantial contribution to the translocation energetics. The 
role of hydrodynamic correlations is best highlighted by computing the work done by the 
moving fluid on the polymer, Wh, over the entire translocation process as compared to 
the case of a passive fluid at rest. Inspection of all events for all polymers translocating 
through wide pores, shown in Fig. El reveals that the cooperation of the surrounding 
solvent and the solute monomers is larger as the length N Q of the polymer increases. 
The work of the driving force, We, (always positive), varies only slightly with the length 
N , as compared to the corresponding behavior of the hydrodynamic work. A rough 
estimate shows that, as the length increases from No = 100 to iVo = 8000, there is an 
increase of about 150% in Wh, while the corresponding increase in We is less than 10%. 
(For these estimates, we used the average values of Wh and We for each length N ). For 
a narrower pore (d p = 5), these effects are still visible, though not as strong revealing a 
higher cooperativity of the solvent during translocation through wide pores. 
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5. Summary 

In summary, multi-scale simulations of the translocation of long polymers, consisting of 
up to 8000 beads, across wide nanopores, capable of hosting multi-file configurations, 
provide clear evidence of a sharp quantization of the translocation process. Throughout 
their translocation, the polymers undertake multi-folded configurations, associated with 
well-defined integer folding numbers. The observed translocation time reveals a bending 
of the scaling law with the biopolymer length, which gives rise to two different exponents, 
one of them describing the short biopolymers and the other the longer ones. The longer 
the polymer and wider the pore, the quantization of conformational folds is more evident 
and is accompanied by an enhancement of the synergistic role of the hydrodynamic field. 
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